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DISTRIBUTION -FREE  INTERVAL  ESTIMATION  OF 


THE  LARGEST  CuQUANTILE* 
by 

M.  Haseeb  Ri zvi  and  K.  M.  Lai  Saxena 
Stanford  University  and  University  of  Nebraska 

1 .  Introduction  and  Formulation  of  the  Problem 

Ordering  of  several  unknown  parameters  is  a  problem  of  wide  practical 
applications.  Saxena  and  Tong  [}|  and  Saxena  [2  J  have  recently  constructed 
confidence  intervals  for  the  largest  location  and  scale  parameters  res¬ 
pectively.  This  paper  deals  with  the  distribution-free  interval  estima¬ 
tion  of  the  largest  a-quantile  of  several  continuous  distributions. 

Consider  k(>  1)  distributions  with  unknown  continuous  cdfs 
F^,  i=l,...,k,  Let  xa(F^)  denote  the  unique  a-quantile  (0  <  O.  <  l) 
of  Fj  .  If  xQ( F. )  is  not  unique,  it  can  be  defined  to  be  so  in  an 
obvious  manner.  Define  8  -  max  x  ( F. )  .  For  a  specified  constant 

LX  i 

]<  i<k 

y,  a  random  interval  I  is  desired  such  that 

(  1)  inf  P(e  t  I)  >r 

Si  ~ 

where  Si  denotes  the  set  of  ail  possible  k-tuples  (  F^,  . . . , F^)  .. 

Such  an  interval  I,  based  on  order  statistics  of  random  samples  of 
equai  sizes  from  each  is  proposed  below. 

* 

The  second  author's  work  was  supported  in  part  by  University  of 
Nebraska  Research  Council. 
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seed  Procedure  and  Its  Probability  of  Covers 


Independent  random  samples  of  coanon  size  n  sire  taken  from  each 
of  the  k  distributions.  Let  Y  .  denote  the  r^*1  order  statistic 
from  F.  and  Y  =  max  Y  ,  for  r=l, ...,n;  define  Y  =  -o»  and 

l<i<*  r'  ° 


Y  -  +<»  .  For  s  <  t,  consider  the  random  interval  1  =  (Y  ,Y.  ) 

n+1  o  s'  t 

and  assert  that  6  e  IQ,  where  8  and  t  are  chosen  so  os  to 
satisfy  (1) . 

With  Gr(p)  denoting  the  incomplete  beta  function 


the  cdf  of  Yr  ^  is  given  by  Gr(F^(y))  .  We  will  adopt  the  conven¬ 
tion  that  Gq(  •)  s  1  and  Gfi+1( •)  &  0  .  The  probability  of  coverage 

of  6  by  I  is  then 

o 


(3)  P(  6  e  x0)  -  p(yb  <  e)  -  p(Yt  <  o) 

-  P(Ye  .  <  6,  i=l, . .  .,k)  -  P( Y,  .  <  e,  1=1,... ,k) 

k  k 

•  *  G  (F .(G))  -  Jr  G  (F  (  6))  . 

i=l  S  1  i=l  t  1 

We  know  that  F^(  6)  >  a  for  i=l, ...,k  with  equality  for  at  least 
one  i  .  Hence  without  any  loss  of  generality  we  assume  that 
FjJ  9)  =  a  .  Thus  (})  becomes 
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k-1  k-J 

(4)  P{G  e  lo)  =  Gg(a)  Gfl(F1(e))  -  Gt(a)  it  G^F^e))  . 

3 .  Minimization  of  the  Probability  Coverage 

For  one-aided  random  intervals  the  minimization  over  n  of 
P(6  £  I  }  is  given  by  Theorem  1.  The  proof  of  the  theorem  follows  from 
considerations  of  ( h )  and  noting  that  Go( •)  s  1,  Gn+^(  *)  =  0  and 

a  <  F,( 6)  <1  for  each  1  .  The  details  are  omitted. 

Theorem  1 

(a)  For  s  >  0,  t  =  n  +  1,  that  is,  with  I  -  (Y  ,  ®), 

c  s 

(5)  inf  P(6  e  I  )  =  gJ(cz) ; 

ft  °  8 

arid  (b)  for  s  -  0,  t  <  n  +  1,  that  is,  with  I  =  ( -®,Y.  ), 

o  t 

(6)  inf  F(e  c  I  i  =  1  -  G.  (a)  . 

a  u 

Note  that  for  any  s  <  t,  G  ( x)  >  G,( x)  .  Therefore  in  (a)  of 

S  X 

Theorem  1,  we  choose  s  to  be  the  largest  integer  such  that  the  right 
side  of  (5)  exceeds  y  of  requirement  (l)  with  0  <  y  <  {l-(l-o:)n)^  . 
Further  in  (b)  of  Theorem  1,  we  choose  t  to  be  the  smallest  integer 
such  that  the  right  side  of  (6)  exceeds  y  with  0  <  y  <  1  -  a11  ,  it 
is  clear  from  the  above  upper  bounds  on  y  that,  for  fixed  k  and  at, 

I  can  satisfy  (1)  for  any  value  of  y  between  0  and  1,  provided  n 
is  taken  large  enough. 

Next  we  consider  the  minimization  over  ft  of  P{6  t  I  )  for  two- 
sided  random  intervals. 
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at(a)'  ai(a)  ■  Gt(a))  • 

Proof 

Since  P{8  e  I  ),  given  by  (4),  involves  F^'s  evaluated  at  6 
(constant)  and  F^(  6)  >cr  for  i=l,...,k-l,  we  can  write  F^(  0)  =  0+6^, 
where  0  <  8^  <  1  -a  .  This  enables  ua  to  reparanetriae  (1+)  as  a 
function  of  the  6^'s  .  Consequently  the  problem  of  minimization  of 
(k)  over  il  *  ((F. , . .  .,F.  ):  F^  is  continuous  for  each  i)  is  reduced 
to  its  minimization  over  {(  6.,  . . 0  <  <  1  -  Ot,  i=l,  . . . , k-l) }  . 

We  have 

k-l  k-l 

(8)  P(0  e  I  )  =  G  (a)  *  G^a+b^  -  Gt(a)  *  G^On^) 

1=1  i=l 

=  J(6i; . .  .,6^),  say  . 


Theorem  2  (two-sided  intervals) 

For  0<a<t<n  +  l> 

(7)  inf  P( ©  e  I  )  =  min{G  (a)  - 
«  0  8 


For  some  j,  fix  V  *  ,,£>J-1'  6j+l' 
Using  (2),  we  define 


6r(p)  =  ^  Gr(p)  =  r(  p)pr_1(l-p)n”r>  0  <  p  <  1, 


and  observe  that  g^(p)/g  (p)  is  increasing  in  p  for  t  >  s  . 


k 


L6~C 


k-l  k-1 

A  -  if  G.(a+&J,  B  =  «  G.(a+&,  ),  A 

1=1  81  4_i  *  i 

MJ  Ui 


B  . 


Then  from  (8)  we  obtain 


0~  AGa(Q!)6fi(«+5(j)[l 


BGt(a)gt(a+&i) 

AGjoJi^a^T 


J 


Since  gt(a+f>j)/gg(a+-&.])  is  increasing  in  6j,  it  follows  that  the 
expression  inside  the  brackets  is  decreasing  in  6.  .  Hence  we  con- 

J 

elude  that  8j/8b.  has  either  the  same  sign  at  every  value  of  6  or 

J  J 

at  most  one  change  of  sign  from  positive  to  negative  and  consequently 

inf  J  is  either  at  6.  =  0  or  at  =  1  -  Ct  ,  This  conclusion  is 
&J  0  o 

valid  for  every  other  j  .  Therefore  infimum  of  J  is  achieved  when 

a  certain  number  m  of  &  :s  are  zero  and  the  rest  equal  to  1  -  a  . 

J 

Define 


(9) 


H  = 
m 


Hm(s,t) 


0^(0)  - 


Gt+^' 


0  <  m  < 


Differentiating  with  respect  to  m,  and  noting  that  G  (ct)  >  g  (ct), 

s  t 

it  is  seen  that  8H^8m  either  has  the  same  sign  for  every  value  of 
m  or  has  at  most  one  change  of  sign  from  positive  to  negative.  Hence 


(10) 


min 

m=0,i, . .  .,k-l 


H 


m 


n^OV^.i)  • 


This  proves  the  theorem. 


5 


Note  that  for  fixed  k  and  a,  I  can  satisfy  (l)  provided  y 
lieu  between  0  and 

min  ([l-(l-a)n]r  -  a™  )  =  p(k,a,n),  say  . 
r=l,k 

Clearly,  p(k,a, n)  can  be  made  arbitrarily  close  to  1  by  talking  n 
large  enough. 

4.  An  Optimum  Tvo-Sided  Random  Interval 

For  specified  k ,d,y  and  n,  the  choice  of  integers  s  and  t, 
such  that  the  two-sided  random  interval  1  =  (Y  ,Y+)  satisfies  fl), 

may  not  be  unique  unless  some  criterion  for  an  optimum  choice  is  intro¬ 
duced.  For  the  case  k=l,  Wilks  [4]  proposed  an  optimality  criterion 
that  requires  the  ranks  s  ( of  Y  )  and  t  (of  Y ,  )  to  be  as  close 

S  X/ 

together  as  possible.  Extending  this  criterion  to  k  >  1,  we  would  be 

interested  in  choosing  s  and  t  so  that  the  rank-difference  (t-s) 

is  minimized  for  a  preassigned  y  .  For  this  purpose  we  present  the 

following  algorithm.  Let  c  be  a  positive  integer  less  than  n  and 

consider  I  =  (Y  ,Y  ,  )  .  Denote  by  Q(s,c)  the  infimum  of 
o  s  s+c 

P{Y  <  6  <  Y  )  over  Q  as  given  by  (7).  For  every  fixed  c,  let 
s  s+c 

s  ( c)  be  that  value  of  s  for  which 
o 

Q(s  (c),c)  ^  max  Q(s,c)  . 
l<s<n-c 

Now  choose  the  smallest  c,  call  it  cq,  such  that 
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^8o(co)'co)  ^  y  ■ 

Then  the  optimum  choice  of  the  random  interval  satisfying  the  require¬ 
ment  (1)  is  (Y  ,Y  )  with  c  =  c  and  s  =  a  (c  )  . 

B  B  +  C  O  O  O 

For  moderate  values  of  n,  say  n  <  50,  the  above  algorithm  can 
be  carried  out  in  an  easy  manner  using  the  incomplete  beta  function 
tables  or  the  more  readily  available  binomial  tables  for  smaller  values 
of  n  in  view  of  (2).  For  exaiqple,  when  k=4,  a=o.5,  7=0.90,  and 
n=25,  we  obtain  s =8  and  t-17  .  In  this  illustration,  it  is  inter¬ 
esting  to  note  that  even  for  k=l,2,3  and  the  same  values  of  a, 7  and 
n,  we  obtain  s-6  and  t=17  • 

5.  Large  Sample  Approximations 

For  large  n,  using  normal  approximation  to  binomial  in  (2),  with 
1( •)  denoting  the  standard  normal  cdf,  we  obtain 

(11)  Gr(a)  *  ^(-r+naVCnad-a))1/2)  . 

For  one-sided  intervals  of  Theorem  1,  using  (11),  in  the  case  (a)  we 
take  s  to  be  the  largest  integer  such  that 

s  <  na  +  (na(i-a))1/2o'1(i-71//k), 

and  in  the  case  (b)  we  take  t  to  be  the  smallest  integer  such  that 

t  >  na  +  (n^(l-a) ) 1/^2  7)  . 
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Fur  the  twc-sidei  optimum  random  interval  described  in  Section  4, 


using  ( .1  i ) ,  we  have 


Q(s,o)  «  mint#  d-n)  -  T(  -x  )  .  Ik(  d.- j<  !  -  dN-x))  , 


.  1  /  O  l  / 

ri  c(  ncri  1  •  -J ) ;  '  .  x  •  1  nu.  s  j  (  nU(  1  .a )  )  /  c 


pwiewlcz  and  Tong  [i  !  show  that  for  any  given  d,  the  vaiue  of  x, 

sty  xo,  thi>  maximizes  the  rigid  side  of  (Id)  is  ei’her  the  vaiue  for 

which  the  two  terms  within  braces  of  (12)  ar-  equal  cr  the  vaiue  which 

maximizes  the  second  of  these  two  terms.  Tabl-  i  of  1  1  I  gives  xq  for 

k  el  Min  /  lu  and  d  1(0. 1,'"  For  k  1  .  obviously  x^  -  d/?,  also 

f.  r  k  2  ,  x  -  -  as  shown  in  111  Thus  for  k  1,-r  and  n  iarge, 

f-.-m  ']'  i’  is  se-p  'hat  s  and  '  will  be  equidistant.  from  net  on 

- 1 1 1'-5  r  side.  Tabie  ?  of  |1|  gives  coverage  pr-  babil.i'y  (12)  evaluated 

a*  r.f  Tatil-  1.  We  consider  once  again  th<-  -sample  of  Section 

'■  ill  us  >  rat  o  *.h-  us-  cl'  the  tables  of  Ul  fer  adapting  our  algorithm 

1  o  lire.-  sampie  -sizes,  For  k  a'-o.'".  y-.C-OO,  and  n V*-.,  Table  2 

g’.  -?s  do. 4  .  '.'sing  d  -*• .«  we  obtain  from  Table  i,  x  1 . 4‘2 .  Wow 

o 

from  (  1  •’<  i  with  x  x,  •  we  obtain  s  °-1-  ;  and  c  l'.  -  .  These  values 

f  s  and  c  are  1 1-  ti  'ha  optimum  values  s^(  e\  :  and  c  respectively 

of  Sr-  ion  4,  Since  t  he  Co'-imum  values  have  tc  be  integers,  we  round 
■  hr  m  -.ff  as  si  cn !  •  1  c.  v  .  Thus  in  this  example  we  obtain 

'he  random  interval  ob'ained  previously, 

I'he  g.V'in-ss  of  t|,,-  iarge  sample  app  <•-••>  ims»t  ion  considered  above  is 
dir-c'ly  twla'-j  t .•  the  well  known  convergence  of  binomial  to  normal 


Fcr  commonly  used  values  of  o.  it  is  felt  that  this  approximation  is 

adequa'e  for  sample  sizes  larger  than  >0 

r r>  conclusion  it.  should  be  pointed  out  that  the  problem  of  interval 

estimation  of  the  smallest  ot-quam  I  le  O'  min  x  (  F.  '  can  be  handled 

1<  i<  K  1 

in  b  manner  analogous  to  '■.he  discussion  of  this  paper  by  considering  tne 

r-wrinm  in’er"'il  (Y',Y,  .  s  <-  »  ,  where  Y  -  min  Y 

s’  r  ,  .  ,  r ,  i 

1<  i<  k 
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